Thinking Outside the Box: 
Effects of Modes Larger than the Survey on Matter Power Spectrum Covariance 
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Accurate power spectrum (or correlation function) covariance matrices are a crucial requirement 
for cosmological parameter estimation from large scale structure surveys. In order to minimize 
reliance on computationally expensive mock catalogs, it is important to have a solid analytic un- 
' derstanding of the different components that make up a covariance matrix. Considering the matter 

power spectrum covariance matrix, it has recently been found that there is a potentially dominant 
effect on mildly non-linear scales due to power in modes of size equal to and larger than the survey 
volume. This heat coupling effect has been derived analytically in perturbation theory and while it 
has been tested with simulations, some questions remain unanswered. Moreover, there is an addi- 

■ tional effect of these large modes, which has so far not been included in analytic studies, namely the 
effect on the estimated average density which enters the power spectrum estimate. In this article, we 
work out analytic, perturbation theory based expressions including both the beat coupling and this 
local average effect and we show that while, when isolated, beat coupling indeed causes large excess 

Q ■ covariance in agreement with the literature, in a realistic scenario this is compensated almost en- 

^ I tirely by the local average effect, leaving only ~ 10% of the excess. We test our analytic expressions 

C/5 , by comparison to a suite of large N-body simulations, using both full simulation boxes and subboxes 

■ thereof to study CEises without beat coupling, with beat coupling and with both beat coupling and 
the local average effect. For the variances, we find excellent agreement with the analytic expressions 
for k < 0.2/iMpc^^ at z = 0.5, while the correlation coefficients agree to beyond k = 0.4/iMpc~^. 

■ As expected, the range of agreement increases towards higher redshift and decreases slightly to- 

■ wards z — 0. We finish by including the large-mode effects in a full covariance matrix description 
, for arbitrary survey geometry and confirming its validity using simulations. This may be useful as 

1^-^ . a stepping stone towards building an actual galaxy (or other tracer's) power spectrum covariance 

■ matrix. 

I. INTRODUCTION 

Galaxy surveys (and surveys of other dark matter tracers) are an important tool for constraining cosmological 
parameters and the tracer's power spectrum (or its Fourier transform, the correlation function) is the most valuable 
observable. It can be used to measure the scale of baryon acoustic oscillations (BAO, see e.g. [1-4]), which is a 
^ \ particularly robust feature and a strong probe of dark energy through the measurement's dependence on the expansion 
history, but it is also common to use the full shape of the power spectrum to constrain cosmology (e.g. [5, 6]). In 
order for these measurements to be useful for cosmology however, it is crucial to have an accurate estimate of the 
observable's covariance matrix. As surveys get larger, calculating such a matrix directly from simulations becomes ever 
more challenging. In fact, large numbers of mock catalogs are required to estimate covariance matrices directly from 
simulations and it is computationally very costly to create such numbers with the required volume and resolution. 
Moreover, the covariance matrix is cosmology dependent so ideally one would either be able to generate a large 
number of matrices for different cosmologies, or show that this cosmology dependence can be safely neglected. This 
requirement increases the number of mocks needed in the brute force method even further. It is therefore important 
to develop a good analytical understanding in order to eventually rely less on simulations. 

While the density of tracers may have a non-trivial relation with the dark matter density, a good first step towards 
understanding the true covariance matrix is to study that of the dark matter power spectrum. The two main 
complications in understanding the dark matter matrix are the mode mixing due to the finite survey volume and the 
non-Gaussian nature of the matter overdensity field on small scales (fc ^ 0.1/iMpc~^) due to non-linear evolution. 
While the former is relatively straightforward to quantify ([7]), the latter effects are the subject of a large number of 
studies [8-20], involving N-body simulations, perturbation theory and the halo model. For covarianccs on non-linear 
scales, an important (but often overlooked) role is played by modes of wavelength comparable to and larger than the 
survey size. In particular, [21, 22] found that the interplay of mode mixing by the survey window with non- linear 
correlations between a pair of small-scale modes and a large mode, causes a covariance contribution proportional to 
the power in super-survey modes, and that, in fact, this is the dominant contribution to the covariance matrix on 
small scales. The added covariance affects both the variances and the off-diagonal covariances. This effect is usually 
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called beat coupling (see also [13, 23-25]). A second consequence of the presence of modes larger than the survey 
is that the average density (which is needed to construct the overdensity) estimated from the survey has a variance 
itself, leading to an additional covariance contribution proportional to the large scale power. We will refer to this as 
the local average effect. 

In the standard approach to estimating covarianccs from N-body simulations, the power spectrum is calculated 
using the full, periodic simulation volume. However, this completely misses the large effects discussed above because 
there are no modes larger than the "survey" volume. In order to capture these effects, one thus needs to consider 
either a subvolume of a (much) larger simulation, or implement a varying zero mode ([12]) in the simulations. While 
some such studies have been performed (see e.g. [24, 25]), a systematic study, leading to a complete and well tested 
analytic description has until now been lacking. 

In this article, we aim to present such a study. Wc use a set of large (L = 2400/i~^Mpc) N-body simulations and 
study the aforementioned effects of large modes by considering significantly smaller subboxes (L — 600/i~^Mpc), thus 
imitating the real world scenario of a finite survey embedded in an infinite universe. We build an analytic model for 
the covariance matrix, properly including the effect of the window function and describing non-linear effects using 
perturbation theory. This description includes the beat coupling model from [21], but adds to this an analytic estimate 
of the local average effect. The latter effect needs to be included when building a covariance matrix for a realistically 
estimated power spectrum and has to our knowledge not been calculated analytically previously. 

We compare our model to simulations at five redshifts in the range z = — 2 and generally find excellent agreement 
until well into the non-linear regime. Since wc test several combinations of covariance matrix contributions individually 
by employing different approaches to estimating spectra from simulations, we are confident that our theoretical 
understanding of the dark matter covariance is correct and that we have captured the main effects. 

One of our main results is that, while beat coupling indeed causes excess covariance which soon dominates, the 
contribution from the local average effect cancels most of this out, leaving only a reduced beat coupling contribution 
of about 10% of the original. 

In addition to the effects on non-linear scales discussed above, the mode mixing due to the window function also has 
a well known effect on linear scales, correlating neighboring power spectrum estimators and reducing the variances. 
This effect needs to be taken into account for a realistic survey. We calculate these covarianccs directly using analytic 
expressions from [7] and show good agreement with simulations. Thus, while the main goal of this paper is to study 
the beat coupling and local average effects related to super-survey modes, we also provide a complete expression 
that can be used to calculate a (dark matter) covariance matrix for an arbitrary survey geometry and can serve as 
a stepping stone for calculating a galaxy (or other dark matter tracer) spectrum covariance matrix. We also study 
the cosmology dependence of the covariance matrix and find that for cosmological parameter variations relevant to 
current cosmological constraints, the covariance matrix undergoes changes at the 30% level. 

The outline of this article is as follows. Wc explain the analytic description of the covariance matrix in section II, 
using a simple formalism apt for the description of simulation results. This will lead to three expressions that can 
be tested against simulations (without beat coupling, with beat coupling, and with beat coupling and local average 
effect). We then describe our N-body simulations in section III and compare the results to theory in section IV. 
While for the main results in this paper, we stick to a simple description in terms of a discrete set of modes, we 
discuss a more rigorous and complete treatment of the window function in section V, leading to expressions that can 
be applied to arbitrary survey geometry. We conclude and summarize our work in section VI. Finally, we use our 
analytic expressions to bricfiy discuss the cosmology dependence of the covariance matrix in the Appendix. 

As this article is rather long and technical, we provide some recommendations for a reader on a tight schedule. On 
a first reading, this reader may want to skip the mathematical discussion in section II and jump straight to section 
HE, where the main equations are summarized. Moreover, in section V, one could jump to Eq. (48), which presents 
the final covariance matrix expression for general geometry, and then focus on the results in Figs 6-9, where the green 
points and lines show the covariance matrix corresponding to said equation. 

II. COVARIANCE MATRIX FORMALISM 

In this section, wc work out the formalism for calculating a covariance matrix for a power spectrum measured from 
a cubic volume with uniform selection function, i.e. the geometry of N-body simulations. Wc describe the density field 
in terms of a discrete set of modes, which significantly simplifies notation. We will provide a more general description, 
which can be applied to an arbitrary survey geometry (following [7]), in section V. The more technical discussion 
there also serves to motivate the simpler approach taken in this section. 
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A. N-body Simulation Geometry (Cubic Box) 

We consider a cubic subvolumc V — oi the universe (which later in this article will be modeled by taking a 
subvolume of a much larger periodic simulation box). The matter overdensity field (5(x) = (/9(x) — p)/p, where p(x.) 
is the matter density and p its mean, can be expanded as 

(2^e-*-W- (1) 

To significantly simplify notation, we choose to describe the density field in the volume V in terms of discrete Fourier 
modes in analogy to the usual description of the density field in a periodic box, i.e. we effectively describe the field 
in the subvolume by 

(S(x) ^^e-'''"(5k, k, = ^n, for = 0,±1,.... (2) 

k 

To be more exact, we define the discrete modes as 

6^ = ^ J^d'^e'^'-^Si^). (3) 

In other words, is equal to the FKP estimator ([7]) F(k) (up to a factor V~^) applied to the volume V, and is 
therefore a weighted average over a range of continuum Fourier modes. In particular, while there is no true k = 
mode, the subvolume sees an effective zero mode (see also [12]) 6o = Jy d'^x5{x.) which gets its main contribution 
from continuum Fourier modes |k| < 2tt/L. Note that Eq. (2) is not to be taken literally, as the field is not actually 
periodic with respect to the volume V. We discuss the effect of taking a subvolume more rigorously in section V. 
The statistics of the discrete overdensity modes can be characterized by a sequence of connected n— point functions, 

(<5k<5k') 
{Sk Sk' (5k" )c 

(^k^k' ^k" Sk"')c 

where we have defined the power spectrum P^, bispectrum i?k,k',k" an the trispectrum Tk,k',k",k"- Note that these 
are technically weighted averages of the true, continuum power, bi- and trispectrum, for example (see also section V) 

Po^V{\So\')=V J -^Pik) Yl foiLh/2), (4) 

i—x^y^x 

where jo{x) = smx/x is the zeroth spherical Bessel function. 
The power of an individual mode can be estimated as 

Pk^V\5k?, (5) 

such that (Pk) = Pk as desired, from which one can define a bin averaged estimator in order to maximize signal to 
noise, 

P.-^E^- (6) 

* kGi 

Here, the sum is over all k = ^ n, with integer components n^, such that k lies in some small range defining the z-th 

bin. Ni is the number of modes in i. The expectation value Pi = (Pi) ~ Pk^, with ki a typical mode inside the bin. 
We are interested in the covariance matrix of this bin averaged estimator. 
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Starting from the covariances in the individual mode estimators, 

(<5Pk Spy.,) = Pi + + \ Tk,-k,k',-k', (8) 

we obtain 




where we have introduced the shorthand notation Tij for the bin averaged trispectrum. The term proportional to the 
power spectrum squared is diagonal and given by simply counting modes because the binning in fc-spacc only includes 
pairs of estimators that have zero covariance. As we will discuss in detail in section V, this is not in general the case 
as the window function may correlate different bins. 



B. N-body Simulation Geometry: Perturbation Theory 

The power spectrum appearing in the first term on the right hand side of Eq. (9) is the full non-linear power 
spectrum, which can be estimated directly from simulations. The trispectrum is harder to get from simulations (but 
see [19]), but can be modeled using a variety of perturbation theory schemes or using the halo model. In this work, 
we will use the simple framework of (Eulerian) standard perturbation theory (SPT; [8, 26-28]). Anticipating angle 
averaging, i.e. modulo transformations k — k and k' o — k', the leading order trispectrum for the configuration of 
interest is given as (see^ [8, 27]) 

rk,-k,k',-k' = 12 P't Pl^ [J^3(k, -k, k')P^ + (k o k')] 

+8 ^|k-k'| [i^2(k-k',k')/t.i" + (kok')]' 

+16 Pi'" P^.i"P^'"P2(-0,k)P2(k',0), (10) 

where and F3 are the kernels for the second and third order contributions to the density field, as given in the 
appendix of [28]. We can now distinguish two types of terms. The first two lines represent the "standard" trispectrum 
contributions, that have always been included in PT studies of the trispectrum (see e.g. [8]). When no modes larger 
than the survey are present, like in the artificial case of a power spectrum estimated from the full volume of a periodic 
(N-body simulation) box, these are the only terms entering the trispectrum, giving 

%-]^.Y.^.Y. [^"" [^3(k, -k, V)Pf^ + (k o k')] + 8 [F2(k - k', k') + (k o k')] \ (11) 

* kei ^ k'ej 

We will evaluate T?- numerically as no further analytic simplifications are possible. 



C. The Beat Coupling Effect 



The third line in Eq. (10) is the beat coupling contribution, the importance of which has been realized only more 
recently ([13, 21-25]). Our notation here requires some explanation (see also section V). Consistent with our discussion 
in the previous section, quantities evaluated at k = should really be interpreted in terms of an effective zero mode 
arising from contributions at fc < 27r/L, i.e. Pg'" = V (|(5o™|^) ~ P''"(fc ^ t^/L), generated by continuum contributions 
r(k +e,-k + e', k' + e", -k' + e'") with e + e' ^ -(e" + e'") - n/L. Moreover, F2, given by 

^ „ , N 5 ki • ko / fci fc2 \ 2 , , , , 

F2i^^,k.)^- + ^[^^+fJ+-i^.,■^..)^ (12) 

is at first sight not well defined if one of the arguments equals 0. P2(— 0, k) (and the other affected term) should thus 
be interpreted limit 

P2(-0,k)= J ^P2(~e,k), (13) 



^ Note that the factor 16 in Eq. (7) of [8] should be a factor 8. 
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with |e| tt/L (see section V for a more rigorous treatment of the beat coupHng term with the same result). The 
integral over the direction of e makes this quantity well defined. 
Using the fact that 

11, 

the beat coupling term can be angle (or bin) averaged analytically so that we end up with 

+ P^Pi^Pt- (15) 

Physically, the presence of the beat coupling term is an interesting interplay of mode mixing due to the window 
functions with correlations between pairs of non- linear modes and one larger mode (see [21]). Due to the finite volume 
from which the power spectrum is measured, the estimator P(k) really consists of a weighted average of pairs of 
density modes (5(k + e) (5(— (k + e')), with |e| of order of the fundamental mode 2tt / L. For k in the non-linear regime, 
such pairs correlate with the large scale perturbation 5{e' — e), which in turn causes the covariance between the power 
spectrum estimators to be proportional to the power in these large modes. In our description, this is captured by 
the power in the effective zero mode. Note that the beat coupling creates an excess both in the variance and in the 
off-diagonal elements of the covariance matrix. 



D. The Local Average Effect 



While the beat coupling term derived in the previous section adds significant covariance on non-linear scales, there is 
an additional effect coupling small scale covariance to power in the zero-mode that plays a role in the power spectrum 
estimated from an actual survey. This second effect is caused by the fact that to obtain the overdensity (5, one needs 
an estimate of the average density p. In a realistic survey, one does not know the true average number density of, say, 
galaxies, but instead has to rely on an estimate of the average density within the survey volume, which is modulated 
by the zero mode 5q. This results in a decrease in covariance, partially canceling the beat coupling effect^. We will 
refer to this contribution as the local average effect. 

The local average effect causes the true overdensity estimator to be given by 



5k 



l + <5c 



(16) 



Since p also appears in the numerator of 5, there technically is also a 5o contribution there. However, the Fourier 
transform of the zero mode for non-zero wave vector k vanishes so this term can be omitted. Eq. (16) can be expanded 
in powers of 5o to derive expressions for the n-point functions to the desired order in perturbation theory. For example, 
to next-to-leading order, the expectation value of the power spectrum estimator becomes 

(A) = 

= V (<5k <^-k (1 - 2<5o + i5l + {51))) 

= Pk- 2^^^ +3^p, + o {Pyv'-) . (17) 
The expectation value of the angle-averaged estimator is then 

(h) = - II P>^, Po + ^Pk. Po + O {pW) 

= (l - I y ) + O {pW) . (18) 

Hence, the power spectrum receives a small bias due to the local average effect. The relative correction is < 10""' for 
a lh~^Gpc^ survey so it can be safely ignored for a realistic survey. Here, we have used that the bispectrum ([13]) 

5k,k',k" = 2 Pfc Pk> F2(k, k') + (cyclic) (19) 



^ The difference between using the true and the focal average was afso commented on brieffy in [29] 
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and we have applied the identity (14) to carry out the angle averaging. 

Using the same approach for the covariance matrix, one gets to next to leading order 

C,^^{P.,P,)-{P.){P,) = ^Sf^ + f + 16[^^j Pj};^Pl!^fj^-l^Pj}^Pl!^fj^, (20) 

where we have ignored a small relative correction to the diagonal term of order Pq'^^/V. The local average effect thus 
introduces a term of the same form as the beat coupling term from the previous section, proportional to Pk^ Pk Po/V, 
but with opposite sign. Comparing the coefficients, one finds that the two effects almost entirely cancel out, leaving 
only a small positive coefficient, 16 (^j) — ^ « 1.5 ~ 10% of the original beat coupling coefficient. Note that the 
local average effect to this order consists of contributions from the bispectrum and trispectrum, but also from terms 
that would even be there had the field remained completely Gaussian (but that happen to be of the same order as 
the leading non-Gaussian corrections). 

Eq. (20) thus gives us an expression for the most realistic case we will consider, where modes larger than the survey 
are present and the spectrum is estimated using the local average in the survey volume. 



E. Theory Summary and Outlook 

In the remainder of this article, we will use N-body simulations to test the expressions derived above. We will 
consider three different cases which allow us to separately constrain different combinations of contributions to the 
total covariance: 

• Case 1: periodic box 

The spectrum is estimated from the full, periodic simulation volume. In this case, there is no beat coupling nor 
a local average effect and the prediction for the covariance is to leading order 



C^.-^^^^ + f- (21) 



We obtain the (non-linear) power spectrum appearing above by applying the Halofit"^ prescription [30] to the 
linear power spectrum calculated using CAMB [31]. The bin-averaged trispectrum T^^- is given by Eq. (11), 
where we make one modification. Instead of using the linear power spectra, we use the non-linear spectra. This 
is consistent to the desired order in perturbation theory and turns out to slightly improve the accuracy of the 
model on strongly non-linear scales. 

• Case 2: subbox of periodic box 

Since modes larger than the "survey volume" are now present, there is a beat coupling effect. The leading order 
covariance prediction is 

Note that also for the additional trispectrum terms, we choose to use the non-linear power spectrum as opposed 
to the linear one. 

• Case 3: subbox of periodic box, using subbox mean 

This is the most realistic case, where not only there are modes larger than the survey, but the overdensity 
S = (/9(x) — p)/p is calculated in terms of the average density p of the subbox, as opposed to the "true" average 
density of the full box. The leading order part of the covariance matrix is given by 

^'^1 " '^u- + ^ + 441 V ■ ^ ' 

In the next section, we first describe the details of our simulations. Wc will discuss the results of our comparison 
in section IV. 



^ We have chocked that the difference between using the Halofit spectrum and the average simulated spectrum is small for k < 0.4/i/Mpc. 
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III. N-BODY SIMULATIONS 



In order to test our analytic predictions we use a large suite of N-body simulations. We have two sets of simulations 
consisting of 160 runs of a 2400ft.^^Mpc box with 768'^ particles and 1024 runs of a 600ft.^^Mpc box with 192"^ particles. 
The initial conditions of the 160 and 1024 simulations were set up using different realizations of a Gaussian random 
field with the power spectrum given by CAMB. We adopted a flat ACDM cosmology consistent with the current 
observational constraints [32]. The cosmological parameters are the present-day matter fraction r2,„ = 0.27, Hubble 
constant h = 0.7, baryon fraction fthh^ = 0.023, spectral index = 0.95, and present-day normalization erg = 0.7913. 

The particles were displaced from their initial grid points according to second-order Lagrangian perturbation theory 
using an initial redshift Zi = 19. 

The simulations were performed with the Trec-PM code Gadgct-2 [33] taking only the gravitational force into 
account. We applied a force softening of 70/i~^kpc and used a particle mesh of 2048'^ and 512"^ for the 2400/i~^Mpc 
and 600ft.~^Mpc runs, respectively. Using a much higher resolution simulation, we checked that with these simulation 
settings the power spectrum derived from the simulation data is accurate at the 1% level up to fc < 0.2/iMpc~^ and 
remains accurate within 4% up to fc < 0.4/iMpc~^ at all redshifts. 

To compute the power spectrum from the simulation data, we assign the particles using the cloud-in-cell (CIC) 
scheme to a regular grid with a fixed grid spacing of 1.5625/i~^Mpc in all cases (full box, subbox, subbox with 
zero-padding). Hence, the Nyquist frequency of the grid is the same in all cases, ^Ny ~ 2/i~^Mpc, which is 5 times 
larger than the scales we consider in this paper k < OAhMpc'^ . Therefore we expect the effects due to finite grid 
size (smearing and aliasing effects) to be very small. Nevertheless, we do correct for the smearing due to the CIC 
assignment [34]. 

The simulations and the power spectrum computation needed about 300,000 CPU hours on our in- house cluster 
Hipatia at ICCUB. 



We now compare the simulation results to the analytic predictions summarized in section HE. For the covariances 
of the full periodic box (Case 1), we compare to the 1024 simulations of the 600/i~^Mpc cubed volume and to the 
160 simulations of the 2400/i~^Mpc cubed volume. To study the subbox Cases 2 & 3, we divide the L = 2400/i~^Mpc 
volume into 64 smaller boxes with side L/4 = 600/i~^Mpc each, which provides us with 160 x 64 = 10, 240 different 
subbox realizations. 

For each volume, we estimate the power spectrum Pi in isotropic bins of width Afc = 0.01/iMpc~^ in the range 
fc = — 0.4:hMpc~^ . We then estimate the covariance matrix for the full box, or for a particular subbox of it, by 



where the sum runs over all Nr simulation realizations and (Pi) is the average power spectrum over simulations. In 
the subbox case, we then improve the accuracy by averaging the subbox covariance estimate in Eq. (24) over all 64 
subboxes. Note that, since the subboxes living in the same simulation volume are not independent, it would be wrong 
to directly apply Eq. (24) to all Nr = 10, 240 subboxes. 

Since the non-linear effects discussed in section II become stronger with time, it is interesting to consider the 
covariance matrix for a range of redshifts from z = 2 — 0. We expect our analytic predictions to have the largest range 
of validity at z = 2, as non-linear effects there are smallest. 

In Fig. 1, we show results for the variances in the power spectrum, normalized by the variance based on mode 
counting and the linear power spectrum. Case 1 is tested by both the full L = 2400/i^^Mpc box case (black dots - 
N-body results, black dashed - analytic) and the full L = 600/i~^Mpc box case (red). The blue dots and dashed lines 
test Case 2 and the green ones test Case 3. For all cases and at all redshifts, we find good agreement between theory 
and simulation for bins up to a maximum value fcmax that lies in the non-linear regime. The values for fcmax are 
reasonable given the range of scales over which perturbation theory is expected to be applicable, femax ~ 0.4/iMpc~^ 
at z = 2 to fcinax ^ 0.15 - 0.2/iMpc"^ at z = 0. 

We now comment on the three scenarios of interest individually. 

• Case 1: periodic box 

The L = 600/i~^Mpc and L ~ 2400ft-^^Mpc full-box simulation results agree very well with each other, as 
expected because they are both described by Case 1 and have no beat coupling. They also agree well with the 
analytic prediction. This confirms that excess covariance is not simply caused by the mere presence of large 



IV. RESULTS 
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FIG. 1: Power spectrum variance relative to variance based on linear spectrum and mode counting {2P^i^^/Nk with Nk the 
number of modes per bin) for redshifts 2 = — 2. The dots are results from N-body simulations and the (dashed) lines theory 
predictions from Eqs (21)-(23), using perturbation theory (PT). The results in black and red represent the case where no modes 
larger than the survey are present (but other non-linear contributions are included). The blue results represent the case where 
these modes are present and thus display large excess covariance due to beat coupling. In the most realistic case (green), the 
large modes also affect the power spectrum through the estimated average density and this local average effect reduces the 
excess covariance by ~ 90%. 
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FIG. 2: Correlation coefficients pij = Cij/\/ Cu Cjj of i-th bin at k on x-axis, relative to j-th bin at fc = 0.04 — 0.05/iMpc~^. 
Same comparisons and color coding as in Fig. 1. The beat coupling causes a significant increase in the off diagonal correlations 
(blue), but this is again undone by the local average effect (green). The effect is described well by Eqs (21)-(23). 
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FIG. 3: As in Fig. 2, but here correlation coefficients relative to bin at fc = 0.09 — 0.1/iMpc ^ are shown. 
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FIG. 4: As in Fig. 2, but here correlation coefficients relative to bin at fc = 0.14 — 0.15/tMpc ^ are shown. 
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modes. The following two cases show that what matters is the presence of modes larger than the "survey" 
volume. 

• Case 2: subbox of periodic box 

Beat coupling introduces a large amount of excess variance and is well described by perturbation theory (for 
the range of scales discussed above). 

• Case 3: subbox of periodic box, using subbox mean 

As expected from theory, the local average effect undoes most of the excess variance due to beat coupling. Given 
the noise in the simulation results, the Case 3 simulation results do not distinguish with much significance the 
Case 3 theory prediction from the Case 1 prediction (but certainly from the Case 2 prediction). However, the 
Case 3 simulation results do clearly display a larger variance than the Case 1 results at large k, looking consistent 
with having ~ 10% of the beat coupling excess variance. 

In Figs. 2 - 4, wc quantify the off diagonal correlations by plotting the correlation coefficients between bins, 

P^. = -7==^ (25) 

where we keep one bin fixed and let the value of k corresponding to the other one vary on the horizontal axis. The 
fixed bins are at fc w 0.05/iMpc~^ (Fig. 2), k w 0.1/iMpc"^ (Fig. 3) and k w 0.15/iMpc"^ (Fig. 4). 

The conclusions for the correlation coefficients are similar to those drawn about the variances and again the analytic 
and numerical results agree well. In fact, for Cases 2 & 3, there now is good agreement up to fc = 0.4ft,Mpc~^ for all 
rcdshifts. We do note that the correlations relative to the k « 0.05/iMpc~^ bin appear to be overestimated somewhat 
by the analytic expressions. However, this is because the variance from simulations in the k = 0.04 — 0.05ft.Mpc^^ bin 
is somewhat high due to noise. Since this variance appears in the denominator of Eq. (25), it brings the simulation 
correlation coefficients down, thus explaining the slight disagreement with the analytic result. The other deviation is 
that the correlation coefficients relative to the bins at wave number (fc « 0.1/iMpc~^ and k w 0.15/iMpc~^) are higher 
than expected for Case 1 at low rcdshift. Whereas they should theoretically not have a beat coupling contribution, 
these correlation coefficients behave as if they do get such a contribution similar to the one in the scenario with beat 
coupling and the local average effect (Case 3). This may simply be due to non-linear effects beyond the order included 
in our theoretical expressions. Other than this, the agreement is very good. 

To conclude this section, we use our analytic expressions to quantify the importance of non-linear corrections to the 
covariance matrix on a range of scales relevant to large scale structure surveys. In the left panel of Fig. 5, we show 
the theory based variance, at redshift zero, taking into account all non- linear effects (green), and taking into account 
everything except for the beat coupling and local average effects (red). The figure is the same as in Fig. 1, except that 
the focus is on the theory curves in the range k = — 0.2ft,/Mpc. We see that the full non-linear corrections change 
the variance at the 25% level at fc = 0.15/i/Mpc and by ^ 50% at fc = 0.2/i/Mpc. For comparison, we also show 
in dashed blue the variance if we do not take into account the reduction in excess variance due to the local average 
effect. Neglecting to take the latter effect into account would lead to a gross overestimate of the non- linear variance. 

To quantify the effect of not only the non-linear corrections to the diagonal of the covariance matrix, but also of 
the non-diagonal elements, wc next calculate the squared signal-to-noise ratio, 

(S/Nfik^,^)^ Cll^^P{h)P{k,). (26) 

The sum here is over all pairs of power spectrum bins with both central fc values below fcmax. For P(fci), we use 
the (non-linear) power spectrum in the fiducial model, so that S/N can be thought of as a detection (or amplitude) 
signal-to-noise. The top figure of the right panel of Fig. 5 shows this statistic using a (diagonal) linear covariance 
matrix (based on mode counting; black curve), the full non-linear matrix (green), and the non-linear matrix with 
the off-diagonal elements set to zero (dashed green). The bottom figure shows the relative difference with the linear 
signal-to-noise squared. Compared to the linear case, the non-linear, but diagonal covariance matrix indeed decreases 
the signal-to-noise, as expected from the increase in variance seen in the right panel. However, the off-diagonal 
elements have a significantly stronger effect and decrease the signal-to-noise even further. The signal-to-noisc gives an 
idea of how much Fisher matrix elements in a parameter forecast, or values in a Monte Carlo chain, are affected 
by the non-linear corrections to the covariance matrix. This means that when a power spectrum study, whether with 
real data or in a Fisher forecast, includes k modes well into the non-linear regime, it is not enough to just include 
non-linear corrections into the variance of the power spectrum. Instead, the full non-linear covariance matrix needs 
to be taken into account. 
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FIG. 5: Left Panel: Power spectrum variance relative to variance based on linear power spectrum and mode counting {2P{^i^/Nk 
with Nk the number of modes per bin) for redshift z = 0.5. The red curve represents the case where no modes larger than 
the survey are present (but other non-linear contributions are included). The most realistic case is depicted in green and 
includes the beat coupling and local average effects. Non-linear effects cause an increase of variance of up to 50% on scales 
k < 0.2/i/Mpc. The dashed blue curve shows the variance when the local average effect is not taken into account. Right Panel: 
Cumulative signal-to-noise squared in the amplitude of the non-linear power spectrum (see text for details) as a function of 
largest fc-bin included (top; bottom shows fractional difference relative to the linear matrix). The non- linear, off-diagonal terms 
in the covariance matrix give the strongest contribution to the decrease in signal-to-noise at large k. 



V. GENERAL SURVEY GEOMETRY 



So far, we have found it convenient to describe the statistics of the density field in terms of a discrete set of 
overdensity modes. Whereas this is exact when considering the modes of a full periodic box, this should be considered 
an effective description in the case of subboxes, justified by their simple geometry. In this section, we work out 
the formalism for a general survey geometry in terms of the continuum of Fourier modes that exist in an infinite 
universe. We closely follow the notation and results of [7], but will add to this the trispectrum contributions arising in 
perturbation theory. The discussion in this section will provide justification for the discrete description of the previous 
sections. One issue that we will pay particular attention to is the motivation for using the diagonal mode-counting 
expression for the disconnected part of the covariance matrix in Eq. (9). As we will see, in the FKP formalism, the 
window function correlates power spectrum estimators in different bins (even in the Gaussian case), and the mode 
counting expression is not accurate. We will explain why mode counting was justified in the previous sections, argue 
that for a realistic survey the FKP description including cross correlations is the relevant one, and finally test the 
latter description against simulations. 

We first define our Fourier convention as 

j^^'^-'-m. (27) 

The statistics of the continuum of Fourier modes are given by 

((5(k)(5(k')) = (27r)3p(fc)5^(k + k') 
((5(k)<5(k')(5(k")) = (27r)3B(k,k',k")5^(k + k' + k") 
(<5(k) 5(k') S{k") (5(k"'))c = (27r)3 T(k, k', k", k'") <5^(k + k' + k" + k'") 

(28) 

For a survey with background number density ri(x), consider then the weighted density field 

n(x) w{x.) (5(x) 



F(x) = ^ ^ G(x) <5(x), (29) 
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where w(x) is a weight function that can be chosen to maximize signal to noise. We will ignore shot noise in the 
following, but it is straightforward to include it in the Gaussian approximation (see FKP). The Fourier transform 
is given by i^(k) = {G * S){'k), where "*" indicates a convolution, and the power spectrum can be estimated by 
P(k) = |i^(k)|2, so that 

J ^P(fc')|G(k-k')r (30) 

The measured power spectrum is thus a weighted average of the true spectrum, with a weight function of width of 
order the fundamental mode of the survey, i.e. Afc tt/L if L is the typical scale of the survey. As a relevant example, 
the window function for a cubic box of side L (n = const, lu = 1 inside the box and zero outside) is given by 

G(k) = L3/2 jo (Lfc J2) jo {Lkyl2) jo {Lk,/2) , (31) 

with jo{x) = sinx/x the zeroth spherical Bessel function. If the true spectrum varies little across this range of scales, 
we get 

(P(k)) « Pik). (32) 



A. Covariances - disconnected (Gaussian) contribution 

We first consider the contribution to the covariance between FKP power spectrum estimators arising from products 
of two-point functions. This is the only contribution for a Gaussian density field and we can follow FKP for its 
description. 

We start from the two-point function 

(^^(k) F*(k')) = J ^ P{k") G(k - k") G*(k' - k") « P{k) Q(k' - k'), (33) 

where the second equality is true in the same limit where the (expectation value of the) power spectrum estimator 
equals the true spectrum, and (5(k) is the Fourier transform of the (normalized) squared window function, 

Q(x) = (34) 

J a-'xn^(x) w^(x) 

Hence, the covariance 

(<5P(k)(5P(k')) « |P(fc)g(k-k')P + (k' ^ -k'). (35) 
In FKP, the power spectrum in a bin i is given by 

T^P{k), (36) 



(37) 



where Vk^i is the fc-volumc of bin i, so that the covariance between (isotropic) bins is 



Because of the extended nature of the window function |Gp, there will thus be correlations between different bins. In 
the limit that the bin width is much larger than the width of this window function, these correlations are negligible 
and one can make a further simplification by integrating out |Gp. Using 

/^-"-W^'-' .V^- ,38, 



(27^)3 [/d3xn2(x)«;2(x; 

one ends up with 
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where ki is a typical mode in the i-th bin and we have further assumed that the power spectrum varies httle across a 
bin. This is the well know mode-counting result. The expression for the effective volume is further simplified in FKP 
by choosing an optimal weight function, but we will stick to general u'(x). Note that for our cubic box (or any other 
geometry with constant background number density and weighting), Vcs — V. 

Assuming that embedding our Lgub = 600/i~^Mpc cubed box in a i = 2400/i~"'^Mpc cubed periodic box is a good 
approximation to embedding it in an infinite universe with a continuum of Fourier modes, we can apply the above 
to our simulated scenario and find that for our bin width Afc = 0.01/iMpc^^, there should be significant correlations 
between bins and the mode-counting argument is not valid, in fact overpredicting the variances. We show this in 
figures 6-9, dashed lines, which we will discuss in more detail below. 

Why then was it correct to use the diagonal, mode-counting expression in the previous sections? The reason is 
that, in keeping with our effective periodic box description, our bin average only includes modes which are multiples 
of the subbox's fundamental mode, i.e. k = 27r/Lsubn with the components of n integers, 

A^^E^W- (40) 

This decorrelates the binned power spectrum estimators as the mode mixing kernel 

Q(k) = Jo (isubA:j2) Jo {L,,,^ky/2) jo {L,^bkj2) (41) 

vanishes for separations that are a non-zero multiple of 2-k / Lsuh- Since the kernel equals unity for zero separation, 
the covariance in this averaging scheme is given by the mode-mixing result, 

■' kei k'ej ■' kei 

This begs the question if a similar binning scheme can be applied to an actual survey, in order to decorrelate 
power spectrum estimates in different bins. Unfortunately, for a realistic survey, the geometry will be much more 
complicated than a simple cube, making this rather difficult. In practice, therefore, one would typically embed the 
survey in a much larger cubic volume (zero-padding the part not covered by the survey), apply a Fourier transform, 
estimate the power spectrum for each k on the grid by the embedding box (i.e. modulo 2'k/L with L the size of the 
large box), and finally average to obtain the binned spectrum. This is thus a much denser sampling than the one 
we applied previously and approaches the infinitely dense FKP bin average of Eq. (36). In a more realistic scenario 
therefore, mode counting would not be sufficient and the off diagonal covariances are significant. 

For this reason, it is useful to numerically test the mode mixing in the covariance matrix due to the window function 
as given in Eq. (37). We do this by again calculating the covariance matrix for our Lgub = 600/i~^Mpc subbox, but 
this time using a binned spectrum averaged over all multiples of the fundamental mode of the large L = 2400/i~^Mpc 
simulation box, thus increasing the sampling density by a factor of 4^^ = 64 and approaching the FKP bin average. In 
practice, we zero-pad the exterior of the subbox and then apply the Fourier transform to the full box. The results are 
shown in figures 6-9 and compared to the analytic expression (37). To model the inevitable non- linear effects, we 
add trispectrum contributions to the theory prediction, which we will describe in the next subsection. Even without 
these contributions however, we can already see from the linear regime that the FKP expression works very well and 
that indeed with the more realistic averaging scheme, the off diagonal covariances are considerable and the variances 
are reduced accordingly. 

To conclude this subsection, we briefly compare our beat coupling investigation to that of [25]. They too study the 
beat coupling effect by considering a subbox of a larger simulation volume (although in their case the subbox is only 
a factor 2^ = 8 smaller than that of the simulation, Lgub = 500/i~'^Mpc'^). They use two approaches for estimating 
the power spectrum. Their "zero-padding" treatment corresponds to the FKP-like method described in the preceding 
paragraphs, while the other approach is equivalent to the one we used consistently before this section. While they 
do not comment on the origin of the observed differences, it is reassuring that the simulation results in their Fig. (10) 
agree with our explanation and with our own simulation results. Interestingly, in [25], the excess covariance due to 
large modes is significantly smaller than expected based on beat coupling only. However, they use the local (subbox) 
average to calculate the overdensity. The lower variance can thus be explained very well by the local average affect 
discussed in this article. 



B. General Survey Geometry: Perturbation Theory 

The discussion in the previous section covers the linear regime where the underlying density field is to a good 
approximation Gaussian. Including non-linear effects, it still describes the disconnected contribution to the covariance 
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FIG. 6: Power spectrum variance relative to (linear) mode-counting variance at redshifts 2 = — 2 for Case 3, i.e. with both 
beat coupling and the local average effect included. Comparison is between variances in spectrum averaged over bins (of width 
Afc = 0.01/iMpc~^) with fine sampling (as in FKP, Eq. (40)), which has mode mixing due to the window function (magenta 
dots), and average over modes that are multiples of fundamental mode of subbox (Eq. (36)), which decorrelates the power 
spectrum estimator in bins (green dots). We also refer to the former case as the "zero padding" covariance matrix, as the power 
is estimated from simulations by using the full 2400/t~^Mpc simulation box to do the Fourier transform, but zero padding the 
region outside the 600^~^Mpc subbox. Dashed lines show theory predictions from Eq. (23) (green) and Eq. (48) (magenta). 
The theory covariance matrix thus successfully includes window function effects and non-linear effects. On large scales (small 
fc), the green curve and dots approach unity because the bin averaging scheme causes the variance to be given by mode counting 
and because non-linear effects vanish. The more realistic zero padding case however has significantly lower variances, but strong 
correlations between neighboring bins (even on linear scales) so that the total information content is the same. 
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FIG. 7: As Fig. 6, but here showing the correlation coefhcients pij = Cij / ^Cu Cjj of i-th bin at k on x-axis, relative to j-th 
bin at fc = 0.04 — 0.05/iMpc~^ . The theory predictions again agree quite well with the simulation results although more so for 
correlations relative to bins at larger k, as shown in Figs 8 and 9. Note in particular the large correlation between neighboring 
bins in the case where the bin average is based on fine sampling/zero padding (Eq. (36), magenta). This is due to mode mixing 
by the window function and is separate from the cross correlations due to non-linear evolution. It is absent when estimators 
are averaged using the sparse bin sampling (green), Eq. (40). 
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FIG. 8: As Fig. 7, but correlations relative to bin at A; = 0.09 - 0.1/iMpc 
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FIG. 9: As Fig. 7, but correlations relative to bin at fc = 0.14 - 0.15/tMpc 
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matrix provided that the non-linear power spectrum is used. However, additional terms are now needed as the 
covariance matrix receives contributions from the connected part of the four-point function. These terms were already 
included in the figures 6-9 (and were discussed in a simpler framework in section II). Here we discuss the details of 
the inclusion of the trispectrum terms for arbitrary window function. 

Including these terms, the covariance between single-mode estimators now becomes 

{SPik)SP{k')) = |p(fc)g(k-k')|' + (k'^-k') 

+ ^ ydVrf'q2dWq4<5^(qi -q2+q3-q4)G(qi)G*(q2)G(q3)G*(q4) 

X r(k-qi,-(k-q2),k'-q3,-(k'-q4)). (43) 

Note that this is the generalization of Eq. (8), which we used to describe the covariance in a cubic box. To leading 
order in perturbation theory, the trispectrum is given by^ (see [8]) 

r(ki, k2, k3, k4) = 4 [^^2(ki2, -ki) F2(ki2, -kg) P(fci) P(fci2) Pih) + perm.]-|-6 [i^3(ki, k2, kg) P(fci) P(fc2) P{h) + eye] , 

(44) 

with 12 distinct permutations for the first term, 3 cyclic permutations for the second, and where k^ = ki + kj. 
Inserting the arguments from Eq. (43) into this perturbation theory expression, and omitting the q^ dependence 
whenever q^ appears in an argument as a correction to a larger vector of order fc, fc', |k -I- k'|, etc, the trispectrum 
contribution becomes 

T (k-qi,-(k-q2),k'-q3,-(k'-q4)) = [4P(|k + k'|) (F2(k + k', -k) F(fc) + (k ^ k'))' + (k' ^ -k') 

+ 4P(fc) P(fc') P(|q2 - qil) (P2(q2 - qi, k) + P2(q2 - qi, -k)) (P2(q2 - qi,k') + P2(q2 - qi, -k')) 

+ 12 P(fc) P(fc') [P3(k, -k, k') P(fc) + (k o k')] (45) 

where we have used that the trispectrum arguments sum up to zero. The terms that are independent of the q.^'s can 
be pulled out of the integral in Eq. (43) so that the integral over mode mixing kernels simply gives a factor of V^^. 
The other terms only depend on the combination qi — q2 so that three of the four Fourier integrals can be carried 
out analytically and one remains, giving the result 



4P(|k + k'|) (P2(k + k',-k)P(A:) + (kok'))' + (k' ^ -k') 
12 P{k) P{k') [P3(k, -k, k') P(fc) + (k ^ k')] 

4P(fc)p(fc') (I (^igp(u) 



((5P(k) 5P(k')) = |P(fc)Q(k-k')|' + (k' ^ -k') 
1 

1 

^ KTff 

X y'^|gP(u)P(w) (P2(u,k)+P2(u,-k)) (P2(u,k')+P2(u,-k'))). (46) 

The final step towards a covariance matrix is to apply the bin average to Eq. (46), but for the trispectrum terms 
the subtle, averaging related effects described in the previous subsection are not important because the correlations 
from the trispectrum are not as narrow as those for the disconnected (or Gaussian) terms. The angle averaging can be 
done analytically for the last term (the beat coupling term) and has to be carried out numerically for the remaining 
trispectrum contributions. The final result is 



* Unlike in section II, but consistent with perturbation theory, we now immediately express perturbation theory quantities in terms of 
the non-linear power spectrum P and not P''". 
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where r'^(k, — k, k', — k') represents the second and thhd hnes of Eq. (46). The last term is the beat couphng term, 
which can now exphcitly be seen to be proportional to a weighted average of the power spectrum over large modes. The 
integral describing this average can be compared to that for the expectation value of the zero-mode power spectrum, 
Eq. (30). The only difference is that the latter is given in terms of G while the former is given in terms of Q (note that 
the normalization integral appearing in the beat coupling term is equal to one when Q is replaced by G). However, 
for a cubic box (with n = const, w = 1 inside the box), the two quantities are exactly equal and the average appearing 
in the beat coupling expression above can be replaced by the zero-mode power spectrum, thus justifying our use of 
P(5" in Eq. (15) in section II B. 

We will not rederive the correction due to the local average effect for the case of arbitrary geometry, but in analogy 
with the cubic subbox case, we will use the following expression for Case 3 in Figs 6-9: 



Note that for a varying background density, the estimate of the background density p(x) used in (5(x) ~ (/5(x) — 
p{x.))/p{x.) is not only affected by the effective zero mode, but also by smaller modes. These will also affect the 
covariance matrix, but we will not go into this effect here, as it is beyond the topic we set out to study. 



In this work, we have studied the effects of modes larger than the survey on the (dark matter) power spectrum 
covariance matrix. We have built an analytic description that includes the beat coupling effect of [21], but also the 
previously overlooked (at least in analytic studies) effect of large modes on the estimated average density p which 
enters the overdensity through 5 = [p — p) / p (the local average effect). We have confirmed this model and the role 
of individual contributions by comparing to covariance matrices obtained from N-body simulations. To study the 
role of super-survey modes in the simulations, we estimated the power spectrum from a subvolume embedded in a 
considerably larger simulation volume. 

We summarize our main results below: 

• We build a model based on perturbation theory for the matter covariance matrix that includes the effects of 
modes larger than the survey. For the variances, we find excellent agreement with simulations for k < 0.4/iMpc~^ 
(or larger) at z = 2 and for k < 0.2(0.15)ft-Mpc~"'^ at z = 0.5(0). Agreement is even better for the correlation 
coefficients, with our model predicting the correct coefficients for at least k < 0.4/iMpc^^ at all redshifts. 

• When isolated, the beat coupling effect from [21, 22], can indeed be described by the last term in Eq. (22), as 
shown by the blue points and curves of Figs 1-4. 

• In a more realistic approach, the local average effect needs to be taken into account as well. This has previously 
been overlooked in analytic studies and we derive its effect using perturbation theory, leading to Eq. (23). It 
reduces the covariance, leaving only 10% of the original beat coupling excess covariance, as shown by the green 
lines and points in Figs 1-4. This also explains the disagreement found between the beat coupling-only expression 
and simulations in [25]. We conclude that the beat coupling excess covariance is not as important as previously 
thought. 

• Eq. (48) gives the final result for the matter covariance matrix for arbitrary survey geometry, including not only 
the above mentioned effects, but also the correlations between neighboring power spectrum bins and the related 
reduced variance due to the survey's window function. It is depicted in Figs 6-9 and again agrees well with 
simulations. It can be used as a first step towards a covariance matrix for the galaxy (or other tracer's) power 
spectrum. 

In a real survey of large scale structure, the effects discussed in this paper are relevant because there are always modes 
larger than the survey volume. Our results are thus important for large scale structure surveys and in particular galaxy 
surveys, as we quantify the expected excess covariance due to these modes, as well as the more standard covariance 




(48) 



VI. SUMMARY AND DISCUSSION 
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contributions. With a complete description of the covariance matrix now available, it is also possible to study the 
cosmology dependence of the covariance matrix in an efficient manner. We refer to the Appendix for a first look into 
this issue. 

To build a complete covariance matrix for the galaxy power spectrum, one needs to include the effects of shot 
noise, galaxy bias and redshift space distortions. While the effect of shot noise is easy to incorporate within the FKP 
formalism (at least in the limit where the shot noise can be treated as Gaussian) , the other effects are more complicated 
and clearly beyond the scope of our paper. However, a very rough first approximation to a galaxy covariance matrix 
could be obtained by multiplying the dark matter matrix by the galaxy bias to the fourth power, including an angle 
averaged Kaiser factor (as in [25]) to account for redshift space distortions, and using FKP to include the effect of 
shot noise. 

In addition to the complications arising from observing galaxies (or other tracers) instead of dark matter, one also 
needs to be able to describe the covariance for a realistic survey geometry (not many surveys have a cubic footprint). 
For this reason, we presented in section V a full description of the model for arbitrary survey geometry, assuming 
the FKP estimator is used. By comparing to simulations, we showed that the expressions from [7] (with trispectrum 
terms added) can be used to accurately describe the mode mixing due to the survey window function and the resulting 
correlations between neighboring power spectrum bins. While section V does not present any completely new results 
relative to the previous sections and what is in the literature, its main purpose is to present a complete set of equations 
to describe the matter power spectrum for arbitrary survey geometry. This can then serve as a stepping stone towards 
a full galaxy covariance matrix. 
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Appendix A: Cosmology Dependence of Covariance Matrix 

One major advantage of the analytic expressions presented in this work, is that they allow for a quick estimate of 
the covariance matrix, especially compared to methods using N-body simulations. The ease with which covariance 
matrices can be calculated makes the analytic method perfectly suited for studying the cosmology dependence of 
the covariance matrix. A full study of this cosmology dependence would consider how constraints on cosmological 
parameters are affected and would quantify the error induced by ignoring the cosmology dependence. Such an 
investigation is beyond the scope of this article and we will leave it for future work. In this appendix, we will simply 
quantify how much the covariance matrix changes as we vary individual cosmological parameters. 

For simplicity, we again consider the covariance in the matter power spectrum, as estimated from a cubic volume 
with V = = (600/i~^Mpc)'^. We imagine the spectrum is "measured" at redshift z = 0.5 and include the smearing 
effect due to the window function that arises when the FKP estimator is used. In other words, we will consider the 
covariance matrix given by Eq. (48), with the power spectrum evaluated at z — 0.5. 

In practice, galaxy surveys typically measure the power spectrum relative to a fixed, fiducial background cosmology, 
see e.g. [35]. Therefore, when a Monte Garlo chain is run, the theoretical power spectrum at each point in parameter 
space needs to be rescaled to account for the effect of using the fiducial background cosmology as opposed to the 
actual background cosmology at that point in parameter space. Only after this rescaling can it be compared to the 
observed spectrum. For consistency, we therefore also rescale the covariance to the fiducial background cosmology, 

C/(Afc„ AA:,) = A-6c(Afc,/A AfcjM)|y=y,A3, (Al) 

where is the covariance matrix of the power spectrum estimator relative to the fiducial background cosmology, C 
the true covariance matrix given by Eq. (48) (evaluated at the actual volume V = Vf A'^ , with Vf the survey volume 
calculated in the fiducial cosmology), Aki the bin widths, and A = dv{z)/dy{z) the dilation factor (see [35]). 

Since the different terms in the covariance matrix scale either like the second or third power of the power spectrum, 
we expect a significant dependence on as and other parameters changing the overall normalization of the spectrum. 
The beat coupling/local average term is particularly sensitive to the power at A: ^ 2tt/L so that it should depend on 
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FIG. 10: Left Panel: Variance relative to linear, mode counting based variance as a function of cosmology. The black curve 
shows the fiducial cosmology, and the other curves depict the effect of varying each ACDM parameter by approximately twice 
its error bar expected from a large scale structure plus CMB measurement (see text, and Table I). Right Panel: The signal- 
to-noise squared in the detection/amplitude of the power spectrum as a function of the largest included mode k (main figure). 
The bottom inset shows the relative difference with the fiducial cosmology. Both panels show that the strongest parameter 
dependence is on as and ilm. 



any parameter that affects the power on very large scales. To make this more quantitative, we consider the ACDM 
fiducial cosmology described in section III and change each parameter p by a step Ap, one by one, keeping the 
remaining parameters fixed. For the step sizes, we choose Ap = 2ap, where ap is the parameter uncertainty from [6] 
(Table 3), obtained from combining the SDSS-II Data Release 7 [36] halo power spectrum with WMAP5 [37] cosmic 
microwave background (CMB) data. These step sizes are indicative of the relevant parameter range in a Markov 
Chain Monte Carlo (MCMC) analysis with power spectrum data. We show the fiducial parameters and their step 
sizes in Table I. 









(^8 


ris 


fiducial 


0.023 0.1323 


0.27 


0.79 


0.96 


step size 


0.00116 0.008 


0.038 


0.05 


0.026 



TABLE I: Fiducial values and step sizes used to test cosmology dependence of matter covariance matrix. 



In the left panel of figure 10, we show the effect on the variance of each parameter. As before, we normalize 
the variance by the variance based on mode counting and the linear power spectrum (as calculated in the fiducial 
cosmology). The units on the horizontal axis are [/i^/Mpc], where hf = 0.7 is the dimcnsionless Hubble parameter in 
the fiducial model. The variance in the fiducial model is given by the black curve. As expected, the largest variation 
is obtained when varying erg, causing a significant increase in variance. The effect of the other parameters is much 
smaller. 

The effect of the parameter variations on the correlation coefficients is always below |Apij| 0.05, with ujm 
decreasing the coefficients most, and CTg increasing them by the largest amount. Finally, to incorporate the properties 
of the full covariance matrix in a single statistic, we calculate the squared signal-to-noise ratio in the (non-linear) 
power spectrum amplitude as a function of fcmax, see Eq. (26). In a more thorough analysis, one could replace the 
power spectrum in the expression for (S/N)"^ by derivatives with respect to cosmological parameters in order to create 
a full Fisher matrix. 

The right panel of figure 10 depicts this signal-to-noise squared as a function of fcmax for the different cosmologies, 
with the bottom inset showing the relative difference with respect to {S/N)'^ in the fiducial model. We focus on the 
range fc = — 0.2 /ij/Mpc, as this is a more realistic range for a galaxy survey (due to strong non-linearities and shot 
noise on smaller scales), and because we have seen that our approach loses accuracy on smaller scales. Consistent with 
the picture arising from the left panel, we see again that erg has by far the largest effect, decreasing the signal-to-noise 
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by ^ 30% (with only a weak dependence of the relative change on fcmax)- The other parameters have a much more 
modest effect 10% for fcmax ~ 0.1/i/Mpc). We conclude that reasonable variations in cosmic parameters can cause 
~ 30% changes in the covariance matrix and thus in Ax^ values in an MCMC chain. 

Finally, we note that, for a galaxy survey, the effect of galaxy bias on the covariance matrix will be very important 
as well, and will look similar to the effect of erg discussed above. 
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